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Abstract. We propose a measure of shape which is appropriate for the study of 
a complicated geometric structure, defined using the topology of neighborhoods of 
the structure. One aspect of this measure gives a new notion of fractal dimension. 
We demonstrate the utility and computability of this measure by applying it to 
branched polymers, Brownian trees, and self-avoiding random walks. 



1. Introduction. 

1.1. Complicated Geometric Structures. We consider geometric structures S 
in an ambient m-dimensional Euclidean. Many such structures of physical interest 
have great local complexity, with features on a wide range of length scales. Typically, 
these structures are generated by a probabilistic process. Fig. [T] shows the dimension 
m = 2 versions of the examples we will consider in detail in this paper. 
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(a) Branched Polymer (b) Brownian tree (c) Self-avoiding walk 

Figure 1. Three examples of polymers constructed by different processes 
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The dimension m = 3 versions of these structures are well known models for inter- 
esting physical systems. For example, a branched polymer is a model for a large 
branched polymer molecule, and a self-avoiding (random) walk is a model for long 
chain polymer. A Brownian tree is a model for soot deposition. These structures 
are also represent universality classes of interest to mathematicians. For example, 
m = 2 self-avoiding walk is conjectured to be SLE 8/3 (where SLE means Schramm 
Loewner evolution). 

In this paper, we propose a mathematical measure of shape suitable for the study of 
such complicated structures S. 




Figure 2. Scatter plot of feature points for a branched polymer, m = 
2, i = 1. 



1.2. Distributions of P.H. Points. We fix an integer i, < % < m, which we call 
the degree. Then the geometric structure S gives rise to a scatter plot of points in the 



plane, as in Fig. 2]^ We call these points P.H. points, where P.H. stands for persistent 
homology. The reason for the terminology will be described later on. Each P.H. 
point represents a particular kind of 2-dimensional "feature" of S. If the dimension 
m is 2 as in Fig. [T] and the degree i is 1, this feature is intuitively a "gulf" largely 
surrounded by S, as can be seen in Fig. |3j The x-coordinate of the feature point 
representing a given gulf measures its size, and the ^/-coordinate measures what we 
call its aspect: how completely the gulf is surrounded by S. Visibly in Fig. [TJ there 
are many such gulfs of different sizes and aspects, which is reflected in the many P.H. 
points in Fig. [2j 



1 The sharp cutoff on the left of the scatter plot is due to the geometry of small configurations of 
balls, and is not of interest to us. 
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The distribution of the P.H. points in the plane is the measure of shape of S that 
we study in this paper. We can define interesting other geometric quantities and 
properties of S in terms of this distribution. 

1.3. P.H. Dimension and P.H. Self-Similarity. When studying fractal sets 5* 
such as the sets mentioned in §1.1| above, the notions of Hausdorff dimension and 
self-similarity are traditionally useful. These notions have analogues defined in terms 
of the distribution of P.H. points. 

We define the P.H. dimension of S by the rate of growth of the density of P.H. points 
with a given x coordinate as x decreases to (see The P.H. dimension is a new 
kind of fractal dimension of S. A computation shows that in some cases, the P.H. 
dimension agrees with the Hausdorff dimension of S. 

We say that S is P.H. self-similar if the distribution of points in the plane obtained 
by rescaling the x coordinate is a scalar multiple of the original distribution of points 
(for small enough x). Ordinary self-similarity is related to Hausdorff dimension. 
Similarly. P.H. self-similarity determines P.H. dimension. 

1.4. The Topology of Neighborhoods of S. The P.H. points of S are defined 
and computed using the topology of neighborhoods of S in the ambient Euclidean 
space. For any positive number e, let S e be the e-neighborhood of S, i.e. S e is the 
enlargement of S that includes all points of distance at most e from S. As e grows, 
the topology of S t changes. We use these changes to define the P.H. points. 

For example, if m = 2 and i — 1, each P.H. point corresponds to a "gulf" in S. As 
e grows, at some time when e = b, a topological change in S t will take place: the 
mouth of the "gulf" fills in so that it becomes a "lake" fully enclosed by S t . At a 
later time when e = d, the "lake" is filled in. The number b is the value of e when 
the lake is born and d is the value when the lake dies. The coordinates x and y of 
the P.H. point are defined by the formulas x = (d + b)/2 and y = arcsec (d/b). 

The general definition of the P.H. points uses persistent homology. The "lake" in the 
example above is replaced by a generator of the 2-dimensional persistent homology 
of the family S e which is born at e = b and dies at e = d. The coordinates of the P.H. 
point are determined by the same formulas (see Since many readers will not be 
at home with homology, we give alternative interpretations of b and d in terms of 
what kinds of shapes S can "trap"). 

1.5. Probabilistic Distributions of P.H. Points. When the structure S is gen- 
erated by a probabilistic process as in the three examples in §!.![ the P.H. points are 
distributed according to a density function f(x,y) determined by that probabilistic 



4 ROBERT MACPHERSON AND BENJAMIN SCHWEINHART 

process. Then the P.H. dimension of S is given by the growth rate of f(x,y) as x 
decreases to zero. 

We define the process to be P.H. statistically self-similar if the function f(x,y) has 
the form 

f(x,y)^x-^.g(y) 
for small enough x. The number d will be the P.H. dimension. 

1.6. Computer Calculations. In most cases, the density function / cannot be 
calculated in closed form. However, it can be estimated by creating some sample 
structures S, and then calculating the scatter plot of P.H. points for each structure 
S. This is what we do in this paper. 

We do detailed computer computations for the three geometric structures of Fig. [T] 
In addition to their physical interest, we chose these structures because recently de- 
veloped algorithms make computer construction of these structures tractable. Also, 
these structures are topologically trivial, so the fact that we can obtain useful infor- 
mation about them with topological methods is perhaps surprising. The definition 
of these structures, together with a description of the algorithms to compute them 
is in Q and information about the algorithms used to computer their P.H. points is 
infj 

1.7. Computational Results. The results of our computations are in £[6] They 
show that 

• In cases where the structure S has is known or believed to be statistically self- 
similar with Hausdorff dimension d, we find that it is also P.H. statistically 
self-similar with P.H. dimension d (for all degrees i). 

• In cases where the structure S fails to be statistically self-similar, it also fails 
to be P.H. statistically self-similar. 

• Interesting differences in the statistical shape of structures S that are unre- 
lated to fractal dimensions can be detected by the y (or aspect) dependency 
of f(x,y). 

We conclude that the study of the P.H. points of a complex geometric structure 
provides insight in these cases, and we hope that it will have broad applications in 
the future. 
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2. Constructing the P.H. Points. 



We consider geometric structures S that are subsets of m-dimensional Euclidean 
space M m . For a number e > 0, let S e denote the e-neighborhood of S, i.e. the set of 
all points in M. m that are of distance at most e from some point of S. It is useful to 
think of e as time, and to think of the system of neighborhoods S e as growing over 
time. 



2.1. The Definition Using Persistent Homology. Readers unfamiliar with ho- 



mology should skip this formal definition, and go directly to §2.2| and §2. 3| where the 
idea is described more intuitively. For an integer i with < % < n, Hi(S t , R) denotes 
the degree % reduced homology group with real coefficients, and is a real vector space 
measuring the topology of the e-neighborhood S e . As e varies, we get a family J 7 
of vector spaces, one for each value of e. The fundamental result of persistent ho- 
mology [14J is that the family J 7 is a sum of extremely simple families Tj each of 
which represents a one dimensional vector space that is born at time e = bj and dies 
at time e = dj^ Our scatter plot of P.H. points has one P.H. point for each simple 
family Tj are our features with coordinates (x, y) where 

/ bj + dj\ ( dj 

(2.1) the size x = — J a e as P ec ^ V = arcsec yj^- 

If S is connected (as in the three examples in Fig. fij, then H (S e ) vanishes for all e, 
so there are no P.H. points for i — 0. If S has diameter 5, i.e. the distance between 
any two points of S is < 5, then the x coordinate of any P.H. point is < 5 because 
Hi(S e ) vanishes for e > S. 

2.2. An Example. Suppose that S is an arc of a circle of radius r in the plane. 
Suppose that the angle of the arc is n + 29 (so its length is r(n + 29)). Now we can 
draw the e-neighborhoods S e for various e, as in Figure [3] 

We see that the topology of S e doesn't change until e = rcos(#), at which time it 
develops a hole. This hole persists until e = r, when the hole if filled in. The hole 
gives rise to an element of Hi(S £ ), which is one of our "features" Tj for i — 1. It is 
born at time e = r cos(#) = bj and it dies at time e = r = dj. It gives rise to a P.H. 
point (x,y), where 

fbj + dA fr + rcos9\ ( tL\ 

x = — J = y 2 J a ^ = arcsec yj^ J = 



2 This classification holds for any nice subset of Euclidean space and, in particular, for the geo- 
metric structures we consider here. 
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(a) e = (b) < e < cos(0) (c) e = cos(0) (d) cos(0) < e < 1 (e) e = 1 
Figure 3. Neighborhoods of a circle with an arc removed (r=l) 



As in { 1.2 , the circular arc is like a gulf. The P.H. point's x-coordinate is proportional 
to r, so it is a measure of the size of the gulf. Its y-coordinate or aspect is 9 which 
measures the or how completely the gulf is surrounded by S. For example, when 
9 takes it maximum value of ir/2, the gulf is completely surrounded. This example 
easily generalizes to the two-sphere in IR 3 with a disk of angular size a removed. 



2.3. Interpretation by Objects in Cages. There is an alternative interpretation 
of P.H. points in terms of bodies of various types trapped or caged by S. 

Consider a solid structure S fixed in Euclidean 3-space and a mobile spherical ball B 
of variable radius e that cannot pass through S. The ball could be trapped or caged 
by a gulf or hole of S. If the ball expands, at some point e = dj the ball will no longer 
fit into its cage, so the cage dies for e-balls as e grows beyond dj. If e decreases so 
the ball shrinks, at some point e = bj it can escape it cage, either into the exterior 
or into another cage, so the cage is born only when e grows beyond bj. The numbers 
bj and dj coincide with the birth and death values of a persistent homology class for 
degree % = 2 (and m = 3). The reason is Alexander duality. As always, the P.H. 



point (x,y) corresponding to this cage is computed by equation (2.1). 



In dimension m = 2, for degree % = 1 the features for S can be similarly interpreted 
as cages trapping a disk of radius e. When e < bj the disk can escape the cage, and 
when e > dj the disk won't fit in the cage. 

For any i and m, Alexander Duality yields an interpretation of Hi(S e ) in terms of 
e-neighborhoods of mobile surfaces of dimension m — i — 1 trapped by S. For example, 
if i = 1 and m = 3, instead of trapping balls or radius e, we are trapping snakes of 
radius e (i.e. the e-neighborhood of a curve). We note that if S is a physical polymer, 
its ability to trap particles of size e or tubes of radius e is of physical interest. 
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3. Dimension and Self-Similarity. 

In this section, we introduce three notions that are derived from the scatter plot 
of P.H. points: P.H. dimension and P.H. self-similarity, and P.H. statistical self- 
similarity. These ideas are particularly useful when S is fractal. They are P.H. 
analogues of Hausdorff dimension and ordinary self-similarity. 

3.1. P.H. Dimension and P.H. Self-Similarity. Choose a degree i and a subin- 
terval / of the interval from to tt/2. (If no interval is specified, we take I to be the 
whole interval from to tt/2.) Let F(x) be the number of P.H. points {xj,yj ) oiS 
such that Xj > x and yj is in the interval /j^j Since F is essentially an integral from 
the right with respect to x, F is a nowhere increasing function. 

3.1.1. Definition of P.H. Dimension. We say that S has P.H. dimension d (for the 
data i and I) if 

(3.1) for all c < d, \im x c F(x) = oo and 

i->0 

(3.2) for all c> d, \imx c F(x) = 

x->0 

i.e. if F(x) grows roughly like decreases to 0. 

3.1.2. Definition of P.H. Self- Similarity. Let p be a number, < p < 1, and let I 
be an integer, i > 0. We say that S is P.H. self-similar with similarity ratio p and 
replication number I if for small enough x, the number of P.H. points with coordinates 
{px,y) is i times the number of P.H. points with coordinates {x,y). (To rule out 
trivial cases, we assume S has P.H. points for arbitrarily small x coordinate.) 

3.2. The Relation between P.H. Dimension and P.H. Self-Similarity. The 

following proposition is an analogue of the relationship between self-similarity and 
Hausdorff dimension. 

3.2.1. Proposition. If S is P.H. self-similar with similarity ratio p and replication 
number £, then its P.H. dimension is 

d _ log(l) _ log I 
logO" 1 ) logp 

(To rule out trivial cases, we assume S has P.H. points for arbitrarily small x coor- 
dinate, whose y coordinate is in J.) 



For the applications discussed in this paper, we always take / to be the whole interval from 
to 7r/2. However, we believe that the choice of an interval / will be important in other applications. 
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3.2.2. Proof. Suppose that x < c is "small enough x" in the definition of P.H. self- 
similar. It is enough to show that for x < c, A-y + Bix~ d < F{x) < A 2 + B 2 x~ d for 
some Ai, B^ and for d = — log£/ log p. 

For positive integers k, 

F{p k c) = F(c) + £ + £ 2 + --- + £ k = A + B{p k )' d 

where A = F(c) + £/{l - £), and B = £/{l - £). Now suppose p k+1 c < x < p k c. 
Then 

F(x) > F{p k c) = A + B{p k c)- d >A + B{x/p)- d = A + Bp d X - d = A ± + B lX - d 
F(x) < F(p k+1 c) = A + B(p k+1 c)- d <A + B(xp)- d = A + Bp- d x- d = A 2 + B 2 x~ d 

3.3. Relation between P.H. dimension and Hausdorff dimension. The for- 
mula d = — log£/logp shows that P.H. dimension bears the same relation to P.H. 
self-similarity that Hausdorff dimension bears to ordinary self-similarity. Neverthe- 
less, P.H. dimension and Hausdorff dimension in general measure different things. 
Here are some examples: 

3.3.1. Hausdorff dimension can exceed P.H. dimension, rf-dimensional Euclidean space 
has Hausdorff dimension d but it has P.H. dimension (since the reduced homology 
of Euclidean space vanishes, so F(x) = for any % and I). 

This example highlights a difference: P.H. dimension always measures complexity, 
whereas Hausdorff dimension measures some combination of complexity and topo- 
logical dimension. 

3.3.2. P.H. dimension can exceed Hausdorff dimension . For any p and £, let the set 
S Pi t be the set of real numbers 

= xi,o, £1,1, • • • , %i,e — ^2,0? x 2,i, ■ ■ ■ 1 %2,e 2 — ^3,0? ^3,1 • • • > x 3,e 3 = x 4,o, ■ ■ ■ 

where Xkj+i — Xk,j = 2p k . The set S Pt e has P.H. points only for i — 0, where the points 
are all of the form (p k , ir/2) which occurs £ k times for each k = 1, 2, . . . (representing 
a degree persistent homology class that is born at e = and dies at e = 2p k ). So it 
is P.H. self-similar with similarity ratio p and replication number £. Therefore, it has 
P.H. dimension — log£/logp if % = and / contains ir/2. However, it has Hausdorff 
dimension and it is not self-similar in the usual sense. 

3.4. An Example that is Both Self-Similar and P.H. Self-Similar. We present 
a fractal S in the plane that is interesting because its P.H. points can be calculated 
exactly, and its Hausdorff dimension agrees with its P.H. dimension. 
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3.4.1. Construction of S. The fractal S is depicted with several its e-neighborhoods 
in Figure |4j It is a generalized version of the Sierpinski triangle. To construct it, 
let S 1 be an equilateral triangle with sides of length one and vertices (0,0), (1,0), 
and (1/2, a/3/2). Let fi, f2, and / 3 be contractions of the plane with shared factor 
p < 1/2 that fix (0,0), (1,0), and (1/2,^/2), respectively. Let S k = AOS* -1 ) U 
f 2 {S k - 1 ) U f 3 (S k - 1 ) for k > 1. Then S is defined to be n k S k . If p 
Sierpinski triangle. 



1/2, S is the 



A A 

AA AA 

A A A A A A 

AAAA AAAA AAAA 

A A A A A Al 

A A AA AA AA 

A A A A AAAA AAAA 

AAAA AAAA AAAA AAAA AAAA AAAA 

AA A A AA AA A AA 

A A A A A A A A A A A A 

AAAA AAAA AAAA AAAA AA A AAAA 

A A A A A A AAAA 

A.'. AA AA AA AA AA AA AA AA AA AA AA 

A A A A AAAA AAAA AAAA AAAA AAAA 

AAAA AAAA AAAA AAAA AAAA AAAA AAAA AAAA AA AA AAAA AAAA AAAA 

x = 0.0 x = r 4 (l-2r)/2 x = r 3 (l-2r)/2 

A A 
AA AA 

AAAA 
jtkjtk aaaa AA AA 

x = r 2 (l-2r)/2 x = r 1 (l-2r)/2 x = r°(l-2r)/2 

Figure 4. A self-similar fractal and its e-neighborhoods 




3.4.2. Proposition. S is P.H. self-similar with similarity ratio p and replication num- 
ber 3. Therefore its P.H. dimension is — log 3/ log p (provided that either i — 1 and 
the interval / contains the number y defined below, or else % = and the inter- 
val / contains the number 7r/2). Furthermore, it is self-similar in the usual sense 
with similarity ratio p and replication number 3, so its Hausdorff dimension is also 
-log 3/ log p. 



3.4.3. Proof. We calculate all of the P.H. points of S exactly. 
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First, consider % = 1. For every integer k > 0, there are 3 fe independent cycles 
in Hi(S t ) which are born when e = p k (l/2 — p) = b k . These are the values of e 
illustrated in Figure |4| where the cycles are visible as newly formed holes in S e . These 
cycles die when e = p k a/1 /3 — p + p 2 — p 3 + p 4 = d k . Therefore S has 3 k P.H. points 
(x k ,y k ) where x k = (b k + d k )/2 = p k (b + d )/2 = p k x and y k = arcsec(4/6 fc ) = 
arcsec((io/&o) = V is independent of k. 

Next consider % = 0. For every integer k > 0, there are 2 • 3 k cycles that die 
when e = p k (l/2 — p) = d k . A canonical basis of H (S t ) is indexed by the con- 
nected components of S e . So we can see the homology classes dying since con- 
nected components are fused together at these values of e, Fig. |4j Every homol- 
ogy class is born at e = 0, so we take b k = for all k. Therefore S has 2 • 3 k 
P.H. points (x k ,y k ) where where x k = {b k + d k )/2 = d k /2 = p k (d )/2 = p k xo and 
y k = arcsec(<ifc/6fc) = arcsec(oo) = ir/2, again independent of k. 

3.5. Probabilistically Determined Structures S. Unlike the example above, the 
physically interesting structures from §1.1| we consider are determined by a proba- 
bilistic process. We state versions of self-similarity and dimension adapted to this 
situation. 

3.5.1. The P.H. Density Function. The P.H. points are drawn from a distribution 
with a P.H. density function f(x,y) which measures the expected number of P.H. 
points per unit area at (x,y). The density function / depends only on the proba- 
bilistic process giving rise to S, unlike the individual P.H. points which depend on a 
particular realization S of that process. In this case, the expected value E [-F(z)] of 
the function F(z) will be 

E[F(z)] = I I f(x,y)dxdy 

J y in I J x=z 

3.5.2. P.H. Statistically Self-Similar Processes. We call the process giving rise to S 
P.H. statistically self-similar with self-similarity dimension d if the P.H. points in the 
plane are sampled from a spatial Poisson process with P.H. density function of the 
form 

f{x,y)^x-^-g(y) 

for some function g(y), at least for x small compared to the diameter 5 of S (at 
which point f(x, y) will vanish). Here g(y), which is a function of the aspect y alone, 
encodes additional information about the shape of S unrelated to the dimension. In 
this case, 

E[F(»] w z- d 
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up to a multiplicative constant, independent of the interval /, and d will be the P.H. 
dimension almost surely. 

3.5.3. Hypothesis. We hypothesize that many physically interesting statistical struc- 
tures S that are statistically self-similar in the usual sense with similarity dimension 
d are also P.H. statistically self-similar with self-similarity dimension d. We give nu- 
merical evidence for this hypothesis in £j6] in the case of 2D self-avoiding walks and 
3D branched polymers by calculating the P.H. points of individual realizations S of 
that process (averaging to improve accuracy) and estimating F from this data. 

4. Three Geometric Structures 

An m-dimensional polymer of order n is a connected subset of K m that is the union 
of n distinct m-balls of the same radius r. We assume that the balls are non- 
overlapping, i.e. two m-balls are either disjoint, or else tangent. In this paper, 
we consider the objects drawn from three different probability distributions on the 
space of polymers. These objects are called branched polymers, Brownian trees, and 
self- avoiding walks. 

We have chosen these three structures to study for several reasons. First, they 
are all models for physical processes of interest, and they have been studied by 
mathematical physicists as representatives of interesting universality classes. Second, 
in each case the union of the balls is itself contractible with probability 1, so the fact 
that topology can yield interesting information is perhaps surprising. Third, there 
has been a theoretical breakthrough leading to an algorithm to efficiently compute 
random representatives drawn from the probability distribution in each case. 

4.1. Branched Polymers. Branched polymers are drawn from a distribution that 
can be viewed as the uniform distribution on the space of polymers. Given a polymer 
composed of n unit balls {Bi, . . . , B n } each of radius r, let xi,...,x n be their centers. 
We define the tree-type of the polymer to be the tree with vertices {1, . . . , n} and 
edges {i,j} such that Bi and Bj are tangent. There is a measure Qt on the space of 
branched polymers whose tree-type is T : 

^ t = n ^( Xi ~ x o) 

ij&T 
i<j 

where Q(xi — xj) is the area form of the unit sphere of vectors Xi — Xj. The probability 
measure on the space of all branched polymers is defined as the sum of this measure 
over each tree type, normalized to have total volume one. 
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We used an algorithm created by Kenyon and Winkler to inductively construct 
branched polymers in two and three dimensions [10J. A two-dimensional branched 



polymer of order 10,000 is depicted in Fig. l(a 



4.2. Brownian Trees. Another distribution on the space of polymers samples ob- 
jects called Brownian trees. They are built inductively - given a Brownian tree with 
n — 1 balls, another ball is placed on a circle at some distance away from it. It then 
moves via Brownian motion until it reaches the existing structure and sticks. This 
process is called diffusion limited aggregation. A Brownian tree composed of 10000 
balls is depicted in Figure l(b)[ 



To create Brownian trees, we used a program written by Mark J. Stock [13] . 

4.3. Self- Avoiding Walks. The third distribution we consider is that of self-avoiding 
(random) walks on a square lattice, which are a model for linear polymers. A self- 
avoiding walk is a subset of the edges of a lattice that forms a non-intersecting path. 
They are given the uniform distribution: if there are I self-avoiding walks of length 
n, each is given weight l/£. We realize self-avoiding walks as polymers by viewing 
each edge as three touching balls. An example of a self-avoiding walk composed of 



500,000 lattice edges is shown in figure l(c 



We used the fast-pivot algorithm created and implemented by Tom Kennedy to 
generate self-avoiding walks [S] [2]. 

4.4. The Limit. The polymers we consider are composed of finite unions of balls of 
a fixed radius and therefore are of finite complexity. If we take the limit as the size 
of the balls goes to and the number of balls increases so the expected diameter of 
the polymer remains fixed, we obtain objects of mathematical interest. One wants 
dimension estimates of these limiting objects and, in particular, we would like to 
compute their persistent homology dimensions (if they exist). 

One expects that the large scale properties of the polymers remain unchanged once 
the ball radius r is small enough, and that these reflect the properties of the lim- 
iting object. In particular, we expect the distribution of P.H. points (x, y) to be 
independent of r once the size coordinate x is large with respect to r. 

5. Computing Persistent Homology of Polymers. 



We calculate the persistent homology of a polymer by first computing nitrations of 
alpha complexes and then loading this information into the JPlex computational 
homology software package |llj . 
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The alpha complex X a of a set of points X is a simplicial complex homotopy equiv- 
alent to the union of the set of balls of radius a centered at those points. It was 
developed by Herbert Edelsbrunner and others [3] [2] [7]. The alpha complex is a 
subcomplex of the Delaunay triangulation. To be specific, it consists of all simplices 
a in the Delaunay triangulation of X such that the vertices of a lie on a sphere of 
radius < a that is empty in X. Refer to [lj for more information concerning the 
alpha complex and details about the algorithm used to compute it. 

Suppose X is a polymer composed of balls centered at points {x^ of radius r. In 
order to compute the persistent homology of this polymer, we compute a sequence 
of alpha complexes of {xi} , A r e — >■ A ei >■ . . . c — > A En , where each e« is a value at 
which a new simplex (or simplices) is added to the alpha complex and A €n contains 
every simplex in the Delaunay triangulation. This computation allows us to assign 
a filtration value to each simplex in that triangulation - the lowest value e such that 
the simplex appears in the alpha complex A e . 

The set of simplices together with their filtration values is then loaded into the 
JPlex software developed by mathematicians and computer scientists at Stanford 
University. This program computes the persistent homology of the polymer from 
this data [UJ. JPlex outputs a set of intervals (bj,dj) corresponding to the lifetime 
of a persistent homology class of the polymer during the e-expansion. This data is 
then processed to create a scatter plot of P.H. points. 



6. Computational Results. 



6.1. Numerical Determination of P.H. Statistical Self-Similarity and Es- 
timation of P.H. Dimension. We consider 2D self-avoiding walks, 2D and 3D 
branched polymers, and 2D and 3D Brownian trees. Using the methods described 
in the previous section, we compute scatter plots of P.H. points for each polymer. 



Then, we calculate the function F(x) defined in Section 3.1 and observe whether 
or not it appears to follow a power law for some interval on the x axis. If so, we 
use linear regression (taking the log of both axes so that a power law Fix) = cx~ d 
becomes a linear relation) to estimate the exponent, which will be an estimate of the 
P.H. dimension. 

Even if the probabilistic process we are modeling is P.H. statistically self-similar, we 
do not expect F(x) to follow an power law for either very high values or very low 
values of x in our individual simulations. Each simulation has two natural length 
scales: the diameter 6 of the polymer S itself, and the diameter r of an individual 
ball in the polymer. We can only expect that our estimate of F(x) is of physical 
interest if x is well above r and well below 5. The function F(x) is identically zero 
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when x > 5 ( §2.1 ) so it cannot follow a power law. Increasing the number n while 



fixing r would increase the upper end of the range of accuracy. When x is close to 
r, the features reflect the local geometry of small collections of balls, rather than 
the limiting object. Note that power law behavior begins at a small multiple of the 
component ball size in Figures [5] and [6] (r = 1 for branched polymers and Brownian 
trees, while it is 1/2 for self-avoiding walks which have additional local structure as 
they are constructed on a lattice). 



Our finite polymers do not have a non-zero P.H. dimension as defined in Section 3.1.1 
because that takes the limit as x — » 0. 




Figure 5. Log-log plots of F(x) for 3D branched polymers 



6.2. Three Dimensional Branched Polymers. Plots of F(x) for degree i — 1 
and degree i — 2 of three-dimensional branched polymers is shown in Figure [5] on 
log-log paper. The data were gathered from ten branched polymers of order 30,000. 
For both i = 1 and i = 2, the plot appears to be linear from about size ~ 4 to size 
~ 15, so the process appears to be P.H. statistically self-similar. Linear regression 
yields exponents —1.99 and —2.03, respectively. Thus the P.H. dimension of 3D 
branched polymers is about 2, which is consistent with the scaling exponent of 1/2 
that Brydges and Imbrie proved for three-dimensional branched polymers in [1] . 

6.3. Two Dimensional Branched Polymers. Here, our results are inconclusive. 
The function F(x) for 2D branched polymers appears to follow a power law with 
exponent —1.61 for a certain range. It was computed from a data set of 10 branched 



polymers of order n = 10, 000 and it is plotted in figure 6(b) However, the fit 
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(a) 2D S.A.W. using H x (b) 2D B.P. using E x 

Figure 6. Size/Features plots for 2D self-avoiding walks and 2D 
Branched Polymers 

is not as convincing as it is for 3D branched polymers and 2D self-avoiding walks. 
Furthermore, the rigorous methods of Brydges and Imbrie [I] do not work in two 
dimensions and do not provide a value for the scaling exponent of 2D branched 
polymer. 



6.4. Two Dimensional Self-avoiding Walks. A similar plot for the 2-dimensional 



self-avoiding walks and degree i = 1 is shown in figure 6(a) . The data were gathered 



from five self-avoiding walks composed of 500,000 lattice edges each. The plot ap- 
pears to follow a power law for a very long range: from size ~ 10 to size ~ 150, which 
is consistent with P.H. statistical self-similarity. Least squares regression yields an 
estimate of —1.334 for the exponent and therefore a value of 1.334 for the P.H. dimen- 
sion. This is very close to the conjectured value of 4/3 for the Hausdorff dimension, 
which was first hypothesized by Chemist P. Flory and has been proven rigorously 
for an object believed to be in the same universality class as two-dimensional self- 
avoiding walks by B. Nienhaus |12j . 



6.5. Brownian Trees. Unlike in the previous cases, the function F(x) for Brownian 
trees does not come close to following a power law, so we conclude that they are not 
P.H. statistically self-similar. The plots for one- and two-dimensional homology for 
3D Brownian trees are shown in figures 7(a) and 7(b) (a plot for 2D Brownian trees 
is similar). The data were gathered from three Brownian trees of order 30,000. 
Note that the plot is significantly bowed out, indicating that the function on a 
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(a) F(x) for degree i = 1 (b) F(x) for degree i = 2 



Figure 7. Log-log plots of F(x) for 3D Brownian trees 

log-log plot is more concave than linear. This result is expected, as Mandelbrot 
et. al. have shown with high statistical certainty that Brownian trees are not self- 
similar [5] [BJ. 



6.6. Shape Measures Involving the Aspect y. The function F(x) is not the only 
way to glean useful information from the Persistent Homology of polymers. 

Consider the bar chart in Figure [8j For each interval / of possible aspects listed on the 
horizontal axis(0 — .2, .2 — 4, etc.), the height of the bar is the ratio Pbp(I)/Pbt(I), 
where Pbp{P) is the proportion of two-dimensional branched polymer features (i = 1) 
whose aspect y is in the interval / and Pbt{I) is the proportion of two-dimensional 
Brownian tree features (i — 1) whose aspect y is in the interval J. Recall that the 
aspect y of a persistent homology class is equal to arcsec (d/b), where d and b are 
the death point and birth point of the class, respectively. This effectively measures 
the angular opening at the edge of a gulf in the structure. 

The proportion of features with low aspect measurements appearing in the branched 
polymers and the Brownian trees is similar. However, features with aspect greater 
than 1.2 are more common in branched polymers and those with aspect greater than 
1.4 are much more abundant. This makes sense when one examines the pictures 
of the branched polymer and the Brownian tree shown in Figure [TJ As described 



in §2.3[ features with a large aspect value are gulfs with the property that a large 
ball can fit inside them but only a very small ball can enter them. Intuitively, this 
difference seems to capture the fact that most features of the Brownian trees have a 
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0-.2 .2.4 .4.6 .6-.S .8-1.0 1.0-1.2 1.21.4 1.41.6 
Aspect = Arcsecant( Death/Birth) 



Figure 8. Ratio of branched polymer features to Brownian tree fea- 
tures for various aspects 

similar shape. These gaps between the branches of the Brownian tree can trap balls 
only slightly larger than those that can enter them. On the other hand, branched 
polymers have more varied gulfs. 

7. Conclusions. 

Geometric measures of shape suitable for probabilistically generated fractal struc- 
tures like branched polymers, Brownian trees, and self-avoiding walks are not so 
plentiful. These structures lack differentiability, so curvature measures aren't de- 
fined. They are contractible as topological spaces, so the usual the usual measures 
of topology (homology, homotopy, etc.) are trivial. 

The topology of the e-neighborhoods S t of S contains more useful geometric infor- 
mation about S. Via persistent homology, the topology of these neighborhoods gives 
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rise to a P.H. density function f(x,y), for each degree i, that depends on two vari- 
ables, the size x and the aspect y. The P.H. density function is a subtle measure of 
shape. 

Our theoretical observations are: 

• This measure of shape can be interpreted as how the set S can "trap" e- 
thickened objects of dimension m — i — 1, e.g. e-balls if i — m — 1. 

• The x dependency of / gives rise to a notion of fractal dimension, the P.H. 
dimension. 

• For some examples like the generalized Sierpinski triangle, the P.H. dimension 
is independent of i, and agrees with the Hausdorff dimension. 

Our main results are computational, concerning branched polymers for m = 2 or 3, 
Brownian trees for m = 2 or 3, and self-avoiding walk for m — 2. We show that: 

• Significant quantitative information about f(x,y) can be obtained by calcu- 
lating scatter plots of P.H. points for individual examples. 

• For branched polymers, m — 3, the P.H. dimension is about 2 for both % — 1 
and % = 2, which agrees with the known estimate for Hausdorff dimension. 

• For self- avoiding walk, m = 2 and % — 1, the P.H. dimension is approximately 
4/3, which agrees with its conjectured equality with the Hausdorff dimension 
of SLE 8/3. 

• For Brownian trees, the function F(x) is strictly concave on a log-log plot, 
so Brownian trees are not P.H. statistically self-similar. 

• The y dependence of f(x,y) is significantly different for branched polymers 
and Brownian trees, which quantifies visible differences between the shapes 
of the two structures. 

This paper is intended as a feasibility demonstration for a method, rather than as 
a conclusive compilation of results. We hope that the shape measure given by the 
P.H. density function will be useful in other situations with complicated geometric 
structures that are probabilistically determined. Examples we would like to see 
include spatial distribution of turbulence, defect structures in materials, foams, and 
many others. 
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